A Comparative Study on Wavelet Packet Based Front-end in Connected Mandarin Digit Recognition

نویسندگان

  • Xiu-Ping Wang
  • Chuan-Qi Zhu
  • Zong-Ge Li
چکیده

This paper investigates the wavelet packet based front-ends for the connected mandarin digit recognition task. Firstly an ERBlike wavelet packet basis is proposed. Then two kinds of wavelets are selected for comparison. One is the Vaidyanathan wavelet, which has good frequency selectivity but big shift variance. The other is the reverse biorthogonal spline wavelet with excellent shift invariant property. Thirdly, the TeagerKaiser energy operator (TEO) based subband cepstral (TC) feature parameters are extracted from the wavelet packet derived multi-frequency channels. The recognition results of the new front-ends are tested and compared with the popular MFCC parameter on the 8K 16-bit speaker-independent mandarin connected digit corpora. Apart from clean data condition, the performances of the new front-ends are further compared in various noisy conditions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combined Feature Extraction Techniques and Naive Bayes Classifier for Speech Recognition

Speech processing and consequent recognition are important areas of Digital Signal Processing since speech allows people to communicate more natu-rally and efficiently. In this work, a speech recognition system is developed for re-cognizing digits in Malayalam. For recognizing speech, features are to be ex-tracted from speech and hence feature extraction method plays an important role in speech...

متن کامل

Noise Suppression Based on Teager Energy Operator for Improving the Robustness of Asr Front-end

In this paper, we proposed a new noise suppression method based on Teager Energy Operator in advancing the noise robustness of speech recognition front-end. The presented method attempts to remove a distortion estimation in Teager energy domain, especially, a Teager energy estimation of noise signal is subtracted from the noisy speech signal. This approach differs significantly from the traditi...

متن کامل

Automatic speech recognition in Mandarin for embedded platforms

In this paper, we describe a real-time automatic speech recognition system for Mandarin for low-cost embedded platforms using fixed-point digital signal processors. The hands-free, speaker-independent speech recognition system employs 41 mono-phone models for representing the sounds in Mandarin Chinese and 11 whole-word models for connected digit recognition. The system achieves greater than 98...

متن کامل

Duration Modeling in Mandarin Connected Digit Recognition

Digit string recognition is required in many applications which need to recognize numbers such as telephone numbers, credit card numbers, date, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there e...

متن کامل

An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition

Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002